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Counting hypergraph matchings up to uniqueness threshold 


Renjie Song* Yitong Yirg Jinman Zhao* 


Abstract 

We study the problem of approximately counting matchings in hypergraphs of bounded 
maximum degree and maximum size of hyperedges. With an activity parameter A, each matching 
M is assigned a weight A^ M t The counting problem is formulated as computing a partition 
function that gives the sum of the weights of all matchings in a hypergraph. This problem unifies 
two extensively studied statistical physics models in approximate counting: the hardcore model 
(graph independent sets) and the monomer-dimer model (graph matchings). 

For this model, the critical activity A c = is the threshold for the uniqueness of 

Gibbs measures on the infinite (d + l)-uniform (k + l)-regular hypertree. Consider hypergraphs 
of maximum degree at most k + 1 and maximum size of hyperedges at most d + 1. We show that 
when A < A c , there is an FPTAS for computing the partition function; and when A = A c , there 
is a PTAS for computing the log-partition function. These algorithms are based on the decay of 
correlation (strong spatial mixing) property of Gibbs distributions. When A > 2A C , there is no 
PRAS for the partition function or the log-partition function unless NP=RP. 

Towards obtaining a sharp transition of computational complexity of approximate counting, 
we study the local convergence from a sequence of finite hypergraphs to the infinite lattice with 
specified symmetry. We show a surprising connection between the local convergence and the 
reversibility of a natural random walk. This leads us to a barrier for the hardness result: The 
non-uniqueness of infinite Gibbs measure is not realizable by any finite gadgets. 


1 Introduction 

Counting problems have long been studied in the context of statistical physics models. Perhaps the 
two most well studied statistical physics models for approximate counting are the hardcore model 
and the monomer-dimer model. 

In the hardcore model, given a graph G = (V, E ) and a vertex-activity A, the model assigns 
each independent set I of G a weight w^(I) = A^l A natural probability distribution, the 
Gibbs distribution, is defined over all independent sets of G as I) = w^(I)/Z^(G) where the 
normalizing factor Z^(G) = X/ w^(I) is the partition function. In the monomer-dimer model, 
given a graph G = (V, E) and an edge-activity A, the model assigns each matching M of G a weight 
u'jy 1 ( M ) = A^ m L The Gibbs distribution over all matchings of G is defined accordingly. And the 
partition function now becomes Zff (G) = Xm^a \M). The counting problems are then formulated 
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as computing the partition functions Z| S (G) and Z^(G), or the log-partition functions logZ^ s (G) 
and log Z ^ (G). 

It was well known that the hardcore model exhibits the following phase transition. For the infinite 
(d + l)-regular tree T^, there is a critical activity A c (Trf) = d d /(d — l) d+1 , called the uniqueness 
threshold, such that when A < A c the correlation between the marginal distribution at the root 
and any boundary condition on leaves at level t decays exponentially in the depth t, but when 
A > A c the boundary-to-root correlation remains substantial even as t —> oo. This property of 
correlation decay is also called spatial mixing, and was known to be equivalent to the uniqueness of 
the inhnite-volume Gibbs measure on the infinite (d + l)-regular tree [33]. In a seminal work [34], 
Weitz showed that for all A < A C (T^) the decay of correlation holds for the hardcore model on all 
graphs of maximum degree bounded by d + 1 and there is a deterministic FPTAS for approximately 
computing the partition function on all such graphs. Here the specific notion of decay of correlation 
established is the strong spatial mixing. The connection of approximability of partition function to 
the phase transition of the model is further strengthened in a series of works [7,9,30,31] which show 
that unless NP=RP there is no PRAS for the partition function or the log-partition function of the 
hardcore model when A > A c (Td) on graphs with maximum degree bounded by d + 1. 

For the monomer-dimer model, it was well known that the model has no such phase transition [13, 
14]. And analogously there is an FPRAS due to Jerrurn and Sinclair [16] for the partition function 
of the monomer-dimer model on all graphs. In [1] strong spatial mixing with an exponential rate 
was established for the model on all graphs with maximum degree bounded by an arbitrary constant 
and a deterministic FPTAS was also given for the partition function on all such graphs. 

In this paper, we study hypergraph matchings, a model that unifies both the hardcore model and 
the monomer-dimer model. A hypergraph hi = (V, E ) consists of a vertex set V and a collection E 
of vertex subsets, called the (hyper)edges. A matching of hi is a set M C E of disjoint hyperedges 
in 'll. Given a hypergraph hi and an activity parameter A > 0, a configuration is a matching M of 
hi, and is assigned a weight w\(M) = AThe Gibbs measure over all matchings of hi is defined 
as /i(M) = vj\(M)/Z\( hi), where the normalizing factor Z\(hi) is the partition function for the 
model, defined as: 

z x {hi) = y A ' M| - 

M : matching of 'H 

This model represents an interesting subclass of Boolean CSP defined by the matching (packing) 
constraints. It also unifies the hardcore model and the monomer-dimer model. Consider the family 
of hypergraphs of maximum edge size d+l and maximum degree k + 1: 

• When d = 1, the model becomes the monomer-dimer model on graphs of maximum degree 

k + 1. 

• When k = 1, the partition function takes sum over independent sets in the dual graph, and 
the model becomes the hardcore model on graphs of maximum degree d+l. 

For hypergraphs, the study of approximate counting hypergraph matchings was initiated in [17]. 
In [5] , an FPTAS was obtained for counting matchings in 3-uniform hypergraphs of maximum degree 
at most 3 by considering the correlation decay for the independent sets in claw-free graphs. In [22], 
an FPTAS was given for 3-uniform hypergraphs of maximum degree at most 4 by the correlation 
decay of the original CSP. All these results assumed A = 1, i.e. the problem of counting the number 
of matchings in a hypergraph. 
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Our results. We show that for hypergraph matchings A c = \ c (Td,k) = knzfiy+i is the uniqueness 
threshold on the infinite (d + l)-uniform ( k + l)-regular hypertree T^fc. 

Proposition 1.1. There is a unique Gibbs measure on matchings of if and only if A < A c . 

This fact was implicit in the literature. Here we give a formal proof. It subsumes the well-known 
uniqueness threshold A c (T^ j i) = ^ d _ d ^d+i for the hardcore model on the infinite (d + l)-regular tree 
and also the lack of phase-transition for the monomer-dimer model. 

We then establish the decay of correlation for hypergraph matchings on all hypergraphs with 
bounded maximum size of hyperedges and bounded maximum degree when the activity A is in the 
uniqueness regime for the uniform regular hypertree. The specific notion of decay of correlations that 
we establish here is the strong spatial mixing [34] (see Section 2 for a formal definition). Consequently, 
we give an FPTAS for the partition function when A is in the interior of the uniqueness regime, and 
a PTAS for the log-partition function when A is at the critical threshold. 

Theorem 1.2. For every finite integers d,k > 1, the following holds for matchings with activity A 
on all hypergraphs of maximum edge-size at most d + 1 and maximum degree at most k + 1: 

• if A < A c , the model exhibits strong spatial mixing at an exponential rate and there exists an 
FPTAS for computing the partition function; 

• if A = A c , the model exhibits strong spatial mixing at a polynomial rate and there is a PTAS 
for computing the log-partition function. 

Remark. The theorem unifies the strong spatial mixing and FPTAS for the hardcore model [34] and 
the monomer-dimer model [1], and also covers as special cases the results for approximate counting 
non-weighted hypergraph matchings in [5,17,22]. 

For hypergraph matchings, the case of critical threshold is of significance. There is a natural 
combinatorial problem that corresponds to the threshold case: counting matchings in 3-uniform 
hypergraphs of maximum degree at most 5. Here d = 2, k = 4, and the critical A c = ~j^m~p)S +1 = 1; 
which corresponds to counting the number of hypergraph matchings without weight. 

Unlike most recent correlation-decay-based algorithms, where the strong spatial mixings were 
established by a potential analysis, we do not use the potential method to analyze the decay of 
correlation. Instead, we prove the following stronger extremal statement. 

Proposition 1.3. For hypergraph matchings, the worst case of (weak or strong) spatial mixing, 
in terms of decay rate, among all hypergraphs of maximum edge-size at most d + 1 and maximum 
degree at most k + 1, is represented by the weak spatial mixing on T d k- 

We construct a hypergraph version of Weitz’s self-avoiding walk tree. Then we show that 
weak spatial mixing on the uniform regular hypertree implies strong spatial mixing on all smaller 
hypertrees by a step-by-step comparison of correlation decay. This was the original approach used by 
Weitz for the hardcore model [34]. Compared to the more recent potential method [19,20,22,27-29], 
this method of analyzing the decay of correlation has the advantage in dealing with the critical case. 

On the other hand, due to a simple reduction from the inapproximability of the hardcore model 
in the non-uniqueness regime [31], we have the following hardness result. 


3 





k 



Figure 1: The classification of computational complexity of approximately counting matchings in hypergraphs 
of max-degree (k + 1) and max-edge-size (d + 1) when A = 1. The blue curve is the uniqueness threshold. The 
non-continuity of the red curve is due to rounding. 

Theorem 1.4. If A > -——A c « 2X C , then there is no PR AS for the partition function or the 

log-partition function for the family of hypergraphs stated in Theorem 1.2, unless NP=RP. 

Figure 1 illustrates the classification of approximability of counting hypergraph matchings when 
A = 1. Each integral point (d, k ) corresponds to the problem of approximately counting matchings 
in hypergraphs of max-degree (k + 1) and max-edge-size (d+ 1). The landscape will continuously 
change when A changes. 

It is worth noticing that in our reduction the hard instances contain many small cycles, while 
from the algorithmic side the worst cases for the decay of correlation are trees. This obvious 
inconsistency between upper and lower bounds and the ad hoc nature of the simple reduction seem 
to suggest that the current hardness threshold is not optimal. 

We then explore the possibility of bringing the current hardness threshold from ~ 2A C down 
to the phase-transition threshold A c . We discover a reason why getting the exact transition of 
approximability could be so challenging for this model on hypergraphs. 

To state our discovery, let us first review the current approach for establishing computational 
phase transition for approximate counting [6,7,9,10,26,30,31], which consists of two main steps: 

• (from all infinite measures to finitely many infinite measures) The uniqueness threshold A C (T^) 
for the Gibbs measure on the infinite regular tree is achieved by a sub-family of Gibbs 
measures with simple structure: the Gibbs measures that are invariant under a group G 
of automorphisms on T^. For the hardcore model, these are the so-called semi-translation 
invariant Gibbs measures, which are invariant under parity-preserving automorphisms on T^, 
and the threshold A C (T^) for the uniqueness of all Gibbs measures on is the same as the 
threshold A C (T^) for the uniqueness of only those Gibbs measures that are invariant under 
the group G of parity-preserving automorphisms. 
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• (from finitely many infinite measures to finite measures) A sequence of (possibly random) finite 
graphs G n is constructed to converge locally to T®, the infinite tree T c ; equipped with the 
symmetry specified by group G. For the hardcore model, and more generally antiferromagnetic 
spin systems, G n are the random regular bipartite graphs [6,7,9,10,26,30,31], which converge 
locally to the infinite tree respecting the symmetry between vertices of the same parity. 
The “random” and “regular” parts in this construction guarantee to preserve the local tree 
structure in distribution, while the bipartiteness respects the parity of vertices. 

For the model of hypergraph matchings, the first step follows. We show that there indeed is a 
group G of automorphisms on the infinite (d + l)-uniform (k + l)-regular hypertree T^fc such that 
^c0^d,k) = ^c(ir^ fc ), i.e. the uniqueness of Gibbs measure on T^k is represented precisely by the 
uniqueness of only those Gibbs measures invariant under G. This gives a natural generalization of 
semi-translation Gibbs measures to the hypergraph model. 

However, we show that there does not exist any sequence of (deterministic or random) finite 
hypergraphs that converge locally to T® fc unless k = 1 where the model degenerates to the hardcore 
model on graphs. In fact, we give a complete characterization of the symmetry described by a group 
G of automorphisms on Y^k that there exists a sequence of finite hypergraphs that converge locally 
t° T d,fc- 

Theorem 1.5. Let G be a group of automorphisms on T d,k with finitely many orbits. There exist a 
sequence of random finite hypergraphs Ti n that converge locally to T®, if and only if the uniform 
random walk on T^k projected onto the orbits of G is reversible. 

See Theorem 7.1 and its proof for more details of Theorem 1.5. 

Discussion. To summarize our discoveries for the model of hypergraph matchings: 

• Theorem 1.2 implicitly but rigorously shows that the worst case for the decay of correlation 
among a family of hypergraphs with bounded maximum degree and bounded maximum 
edge-size, is achieved by the infinite uniform regular hypertree. 

• However, in the current inapproximability stated by Theorem 1.4, the hard instances are not 
locally tree-like, but rather, the gadgets locally converge to an infinite hypergraph which is 
not a hypertree (see Section 6). 

• And finally, Theorem 1.5 gives an explanation of this inconsistence between upper and lower 
bounds: the extremal case for the decay of correlation in Theorem 1.2, which is achieved by 
an infinite-hypertree measure, can never be realized by any finite hypergraphs. 1 

Altogether, these discoveries deliver the following very interesting message: In order to establish a 
sharp connection between computational complexity of approximate counting and phase transitions 
for hypergraph matchings or other more general models, a more fine-grained definition of uniqueness 
on finite graphs is necessary. 

1 In fact, aided by numerical simulations, so far we have not encountered any family of measures on the infinite 
uniform regular hypertree T a,k realizable by finite hypergraphs, whose uniqueness threshold is below 2A C . This seems 
to provide some empirical evidence for that on finite hypergraphs, the worst case for uniqueness might not be locally 
tree-like. 
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Remark on exposition. For convenience of visualizing the results, all our results in the rest of 
the paper are presented for independent sets in the dual hypergraphs. Note that matchings are 
equivalent to independent sets under hypergraph duality. The only effect of duality on a family 
of hypergraphs with bounded maximum edge size and bounded maximum degree is to switch the 
bounds on the edge size and the degree. We emphasize that our notion of hypergraph independent 
set is different from the more popular definition used in [2,3]. We call a vertex subset / C b in 
a hypergraph E = (V, E ) an independent set if no two vertices in / are contained in the same 
hyperedge, while in [2,3], an I C V is an independent set if it does not contain any hyperedge as 
subset. 

Related works. Approximate counting of hypergraph matchings was studied in [17] for hy¬ 
pergraphs with restrictive structures, and in [5,22] for hypergraphs with bounded edge size and 
maximum degree. In [3,24], approximate counting of a variant of hypergraph independent sets was 
studied, where the definition of hypergraph independent set is different from ours. In a very recent 
breakthrough [2], FPTAS for this problem is obtained when there is no strong spatial mixing. In [8], 
the hardness is established for a class of hypergraph models including ours. 

The spatial mixing (decay of correlation) is already a widely studied topic in Computer Science, 
because it may support FPTAS for #P-hard counting problems. The decay of correlation was 
established via the self-avoiding walk tree for the hardcore model [29,34], monomer-dimer model [1,28], 
and two-spin systems [19,20,28]. Similar tree-structured recursions were employed to prove the 
decay of correlation for multi-spin systems [11,12,25] and more general CSPs [21-23]. 

2 Preliminaries 

For a hypergraph E = (V,E), the size of a hyperedge e £ E is its cardinality |e|, and the degree 
of a vertex v G V, denoted by degu = deg-^(u), is the number of hyperedges e£fi incident to v, 
i.e. satisfying v G e. A hypergraph E is k-uniform if all hyperedges are of the same size k , and is 
d-regular if all vertices have the same degree d. The incidence graph of a hypergraph E = (V, E) is 
a bipartite graph with V and E as vertex sets on the two sides, such that each (v,e) €V x E is a 
bipartite edge if and only if v is incident to e. 

A matching of hypergraph E = (V,E) is a set M C E of disjoint hyperedges in E. Given 
an activity parameter A > 0, the Gibbs measure is a probability distribution over matchings of 
E proportional to the weight w^(M) = Al M l, defined as /i^(M) = w^(M)/Z^{E), where the 
normalizing factor Z^(E) = \M) is the partition function. 

Similarly, an independent set of hypergraph E = (17, E) is a set I C V of vertices satisfying 
\I n e| < 1 for all hyperedges e in E. The Gibbs measure over independent sets of E with activity 
A > 0 is given by 

JS/n = w a( J ) = A ' J| (1) 

J Zf(E) ZfiE)' 1 ’ 

where the normalizing factor Z^(E) = Yli w a S (- 0 the partition function for independent sets of 

E with activity A. 

Independent sets and matchings are equivalent under hypergraph duality. The dual of a 
hypergraph E = (17, E), denoted by E* = (E*, 17*), is the hypergraph whose vertex set is denoted 
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by E* and edge set is denoted by V*, such that every vertex v G V (and every hyperedge e G E) in 
H is one-to-one corresponding to a hyperedge v* G V* (and a vertex e* G E*), such that e* G v* if 
and only if v G e. Note that under duality, matchings and hypergraphs are the same CSP and hence 
result in the same Gibbs measure, which remains to be true even with activity A. Also a family of 
hypergraphs of bounded maximum edge size and bounded maximum degree is transformed under 
duality to a family of hypergraphs with the bounds on the edge size and degree exchanged. 

Remark 2.1. With the above equivalence under duality, from now on we state all our results in 
terms of the independent sets in the dual hypergraph and omit the superscript - IS in notations. 

Given the Gibbs measure over independent sets of hypergraph 77 and a vertex v, we define the 
marginal probability p v as 

Pv = Pn,v = Pr[u G I} 

which is the probability that v is in an independent set I sampled from the Gibbs measure (such 
a vertex is also said to be occupied). Given a vertex set A C V, a configuration is a o\ G {0,1} A 
which corresponds to an independent set I\ partially specified over A such that cr\(v) indicates 
whether a v G A is occupied by the independent set. We further define the marginal probability 
Ph,v as 

Pv A = Ph,v = Pr b G / I = O-a] 

which is the probability that v is occupied under the Gibbs measure conditioning on the configuration 
of vertices in A C V being fixed as cr\. 

Definition 2.1. The independent sets of a finite hypergraph 77 = (V, E) with activity A > 0 
exhibit weak spatial mixing (WSM) with rate 5 : N — > M + if for any v G V, A C V, and any two 
configurations G {0,1} A which correspond to two independent sets partially specified on A, 

I Pv A ~P r v K I < <*(distft(u,A)), 

where dist%(u, A) is the shortest distance between v and any vertex in A in hypergraph Ti. 

Definition 2.2. The independent sets of a finite hypergraph TL = {V. E) with activity A > 0 
exhibit strong spatial mixing (SSM) with rate 5 : N —> M + if for any v G V, A C V, and any two 
configurations (ta,ta. £ {0,1} A which correspond to two independent sets partially specified on A, 

\Pv A -P T v A \ < <5(dist w (u, A)), 

where A C A stands for the subset on which o\ and t\ differ and dist^(u, A) is the shortest distance 
between v and any vertex in A in hypergraph TL. 

The definitions of WSM and SSM extend to infinite hypergraphs with the same conditions to be 
satisfied for every finite region conditioning on the vertices in <9T being unoccupied. 

3 Gibbs measures on the infinite tree 

We follow Remark 2.1 and state our discoveries in terms of independent sets in the dual hypergraphs. 
Let TTfc d be the infinite {k + l)-uniform (d + l)-regular hypertree, whose incidence graph is the 
infinite tree in which all vertices with parity 0 are of degree (k + 1) and all vertices with parity 1 are 
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of degree (d + 1). A probability measure /i on hypergraph independent sets of is Gibbs if for any 
finite sub-hypertree T, conditioning p upon the event that all vertices on the outer boundary of T 
are unoccupied gives the same distribution on independent sets of T as defined by (1) with Ti = T. 
We further consider the simple Gibbs measures satisfying conditional independence : Conditioning 
p on a configuration of a subset A of vertices results in a measure in which the configurations on 
the components separated by A are independent of each other. The Gibbs distribution on a finite 
hypergraph is always simple. A Gibbs measure on T k,d is translation-invariant if it is invariant under 
all automorphisms of Tj.^- Fix an automorphism group G of T^. A G-translation-invariant Gibbs 
measure on is a measure that is invariant under all automorphisms from G. For example, the 
semi-translation-invariant Gibbs measures on regular tree are invariant under all parity-preserving 
automorphisms on Ti^. The natural group actions of G respectively on vertices and hyperedges 
partition the sets of vertices and hyperedges into orbits. For example, in the semi-translation- 
invariant symmetry on regular tree, vertices with the same parity form an orbit. We will show that 
is the uniqueness threshold for the Gibbs measures on hypergraph independent 
sets of T*. d. Furthermore, this uniqueness threshold is achieved by a family of Gibbs measures with 
simple structure. 

Theorem 3.1. There is always a unique simple translation-invariant Gibbs measure on independent 
sets o/T^. Let \ c = AcCT^) = k y d _ d 1 yi +1 • There is a unique Gibbs measure on if and only if 

A < A c . Furthermore, there is an automorphism group G on T^ which classifies all vertices ofT^ } d 
into 2 orbits, such that the threshold for the uniqueness of G -translation invariant Gibbs measures 
on T m? denoted as A c (T| d ) ; is A c (T| d ) = A C (T M ). 

This proves the uniqueness threshold stated in Proposition 1.1. 

3.1 Branching matrices 

The automorphism group G on T^d can be described conveniently by a notion of branching matrices. 
For an automorphism group G on T*,^, the natural group actions of G respectively on vertices and 
hyperedges partition the sets of vertices and hyperedges into orbits. Let t v and r e be the respective 
numbers of orbits for vertices and hyperedges. For each i £ [r„], we say a vertex is of type-i if it is in 
the i-tli orbit for vertices; and the same also applies to hyperedges. Assuming the symmetry on T^ 
given by automorphism group G, the hypergraph branching matrices , or just branching matrices, are 
the following two nonnegative integral matrices: 

D = D TvXTe = [dij} and K = K T eXT ” = [A#], 

which satisfy that for any i £ [r„] and j £ [r e ]: 

• every vertex in of type-i is incident to precisely dij hyperedges of type-j; 

• every hyperedge in T k,d of type-j contains precisely kji vertices of type-i. 

The D and K are transition matrices from vertex-types to hyperedge-types and vice versa in T^. 
The definition can be seen as a hypergraph generalization of the branching matrix for multi-type 
Galton-Watson tree [27]. Since types (orbits) are invariant under all automorphisms from G, it 
is clear that the above D and K are well-defined for every automorphism group G on with 
finitely many orbits. 




Proposition 3.2. Every automorphism group G on T k,d with finitely many orbits can be identified 
by a pair of branching matrices D and K with rules as described above and satisfy: (1) Yj dij = d+1 
and Yi kji = k + 1; (2) dij = 0 if and only if kji = 0; and (3) DK and KD are irreducible. 

Conversely, any pair of nonnegative integral matrices D and K satisfying these conditions are 
branching matrices for some automorphism group G on T k,d- 


Proof. Let G be an automorphism group on with finitely many orbits. It is trivial to see that 
the branching matrices D and K are well-defined and satisfy Yj dij = d+1 and Yi kji = k + 1. 

A vertex v of type-* is incident to a hyperedge e of type-j if and only if e of type-j contains a 
vertex v of type i, thus kji 0 if and only if dij 0. 


The irreducibility of DK and KD follows that of the matrix 


0 

K 


D 

0 


which is a consequence 


to the that every type of vertex and hyperedge is accessible from all other types of vertices and 
hyperedges, which follows the simple fact that the incidence graph T*. ^ is strongly connected. 

Conversely, let D and K be a pair of nonnegative integral matrices satisfying the conditions 
above. We can start from any vertex (or hyperedges) o of type-* and construct an infinite hypertree 
rooted at o with each vertex and hyperedge labeled with the respective type according to the rules 
specified by the branching matrices D and K. Since djj = 0 if and only if kji = 0, the construction 
is always possible. Since Yj dij = d+1 and Yi kji = k + 1, the resulting infinite hypertree must be 
k- uniform and d- regular. Since DK and KD are irreducible, no matter how we choose the type 
for the root o, the resulting hypertree contains all types of vertices and hyperedges. 

We can then construct an automorphism group G on Taccording with orbits being the types 
just specified. For every pair of vertices (or hyperedges) it, v with the same type, by generating 
the hypertree according to D, K starting from u and v respectively, we obtain an automorphism 
on which maps it to v and preserves the types of all vertices and hyperedges. Let 
G = (I Vu, v with the same type}) be the group generated from all such automorphisms. 
Then D and K are branching matrices for automorphism group G on T*. □ 


3.2 Extremal Gibbs measures 


Consider a special automorphism group G on T k,d defined by the following branching matrices 
(D, K). Assume that there are two vertex-types and two hyperedge-types, both denoted as {+, — 

T d] 


and the branching matrices are defined as D = 


d 1 


and K = 


k 1 
1 k 


i.e.: 


1. every £ =t’-vertex is incident to a ‘±’-hyperedge and d ‘q=’-hyperedges; 


2. every ‘±’-hyperedge contains k ‘±’-vertices and a ‘^’-vertex. 

See Figure 2 for an illustration. Fix a ‘-(-’-vertex v in as the root. Let p. + (resp. p~) be the 
Gibbs measure on +k d defined by conditioning on all vertices to be occupied for the i-th ‘+’-vertices 
(resp. ‘—’-vertices) along all path from the root and taking the weak limit as t —> oo. Note that for 
the 2-coloring given by D and K, on any path any ‘±’-vertex has a ‘^’-vertex within 2 steps, so 
the limiting sequence is well-defined. And by symmetry, starting from a root of type- 1 —’ gives the 
same pair of measures. 

The /Y generalize the extremal semi-translation-invariant Gibbs measures on infinite regular 
trees. For hypertree with k > 2, there are no parity-preserving automorphisms. Nevertheless, 
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Figure 2: Classifying vertices and hyperedges of ¥ 3,2 into two types ‘+’(black) and ‘ — ’(white). The hypergraph is 
represented as its incidence graph where circles stand for vertices and squares stand for hyperedges. 


the symmetry given by D and K generalizes the parity-preserving automorphisms to hypertrees 
and has the similar phase-transition as semi-translation-invariant Gibbs measures on trees. 

The p± are simple and are G-translation-invariant for the automorphism group G with orbits 
given by D and K. In fact, they are extremal G-translation-invariant Gibbs measures on T k,d- We 
will see that the model has uniqueness if and only if p + = p~. 


3.3 Uniqueness of Gibbs measures 

Lemma 3.3. Let p be a simple Gibbs measure on independent sets of T^. Let v be a vertex in 
T h } d and Vij the j-th vertex (besides v) in the i-th hyperedge incident to v, for i = 1,2,..., d + 1 
and j = 1, 2,..., k. Let p v = p[v is occupied ] and p Vij = p[vij is occupied]. It holds that 


d -\-1 

Pv = a(i - p v y d ni 1 -^ -I>, ) • 
i= 1 V 3 =1 


(2) 


Proof. Since p is a Gibbs measure, for any vertex v in T^, it holds that 

p v = p[v is occupied] = --- • //[all the neighbors of v are unoccupied] 

1 T A 

On the other hand, since p is simple, conditioning on the root being unoccupied the sub-hypertrees 
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are independent of each other, thus 


/u[all the neighbors of v are unoccupied] 

=p[v is occupied] • //[all the neighbors of v are unoccupied | v is occupied] 

d+ 1 

+ n[v is unoccupied] //[VI < j < k, v. L j is unoccupied | v is unoccupied] 
i=l 

d -\-1 / k 

=p v + (1 — p v ) 1 — p\ v ij is occupied | v is unoccupied] 
i =1 \ 3 =1 

Note that for any two adjacent vertices v,Vij, we have //[is occupied] = //[ v^j is occupied | 
v is unoccupied] • p[v is unoccupied], thus 

r . . . ,, p.[vij is occupied] p Vij 

H\Vij is occupied v is unoccupied =- - --r = -=—. 

1 — n[v is occupied] 1— p v 

The lemma follows by combining everything together. □ 

Equation (2) gives an infinite system involving all vertices in TIf the simple Gibbs measure 
// is G-translation-invariant for some automorphism group G on the marginal probability 
p v = p[v is occupied] depends only on the type (orbit) of v. 



Corollary 3.4. Let p be a simple G-translation-invariant Gibbs measure on T^ with branching 
matrices D TvXTe = [dij\ and K TeXTv = [kji\. For every i e [t v \, let pi = p[v is occupied} for vertex 
v in T k d of type-i. It holds for every s E [r„] that 


Vs = K l ~Vs) d 

j'e[r e ] 



Applying with the branching matrices D and K defined in Section 3.2, the system in Corollary 3.4 
becomes 

f P+ = A(1 -p+)~ d {l - kp + -p-)(l-p + - kp-) d , 

[p_ = A(1 — p-)~ d ( 1 — kp- — p+)( 1 — p- — kp + ) d . 


Let x = 


kp+ 


1 —p——kp+ 


and y = ■ The system becomes 


y = f(x) 
X = f(y) 


where f(x) = is 


the hardcore tree-recursion. Since f(x) is positive and decreasing in x, it follows that there is a 
unique positive x such that x = f(x), which means there is always a unique simple translation- 
invariant Gibbs measure on Tk,d- It is well-known (see [9] and [18,32]) the system has three 
distinct solutions (x,x), ( x + ,x~ ) and ( x~,x + ) where 0 < x~ < x < x + , when k\ > d d /(d — l) d+1 , 


i.e. A > A c (T m ) = 


cL d 

k(d—l) d+1 ’ 


and the three solutions collide into a unique solution (x, x) when 


A < A c (Tfe iC j), which means there is a unique simple G-translation-invariant Gibbs measure on T/,,^ if 
and only if A < A C (T*. ^). Recall that /jT are simple and are extremal G-translation-invariant Gibbs 
measures, and hence it also holds that // + = pT if and only if A < A c (H\d), therefore, it holds that 
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A c (Tfc ; d) = A C (T ® d ). In particular if A > X c {Tk,d), then pT ^ p,~ and the Gibbs measure on T k,d is 
non-unique. 

To complete the proof of Theorem 3.1, we only need to show the Gibbs measure on T^. ^ is 
unique if A < X c (Tk,d)- This is implied by the weak spatial mixing on T k,d when A < A c , proved 
later in Theorem 5.5. With the weak spatial mixing on T*. the uniqueness of the Gibbs measure is 
implied by a generic equivalence between weak spatial mixing and uniqueness of Gibbs measure (see 
e.g. [33]). 

4 The hypergraph self-avoiding walk tree 

We call a hypergraph a hypertree if its incidence graph has no cycles. Let T = (V, E ) be a rooted 
hypertree with vertex v as its root. We assume that root v is incident to d distinct hyperedges 
ei, e 2 ,..., ed, such that for i = 1, 2,..., d, 

• \ei\ = ki + 1; and 

* ( 'i — {u, Vi\ , Vi2 1 ••• 5 Viki }• 

For 1 < i < d and 1 < j < k t , let Tj be the sub-hypertree rooted at Vij. Recall that all hypertrees 
considered by us satisfy the property that any two hyperedges share at most one common vertex, 
thus all are distinct and the sub-hypertrees Tij are disjoint. 

Let AcL Let € {0,1} A be a configuration indicating an independent set partially specified 
on vertex set A, and for each 1 < i < d and 1 < j < ki, let a\\ ):j be the restriction of cta on the 
sub-hypertree Tij. Consider the ratios of marginal probabilities: 


«?=Kt K ,J(}-Vt\) “d ~ PtX) ■ 

The following recursion can be easily verified due to the disjointness between sub-hypertrees: 


r> a A 

iL'j- 




<7 A. 


“ 1 + Eli Hr'" 


(3) 


This is the “tree recursion” for hypergraph independent sets. The tree recursions for the hardcore 
model [34] and the monomer-dimer model [1] can both be interpreted as special cases. 

For general hypergraphs which are not trees, we construct a hypergraph version of self-avoiding- 
walk tree, which allows computing marginal probabilities in arbitrary hypergraphs with the tree 
recursion. Moreover, we show that the uniform regular hypertree is the worst case for SSM among 
all hypergraphs of bounded maximum edge-size and bounded maximum degree. 


Theorem 4.1. For any positive integers k, d and any positive X, if the independent sets ofTk^d with 
activity X exhibit strong spatial mixing with rate 5(-), then the independent sets of any hypergraph of 
maximum edge size at most (k + 1) and maximum degree at most (d + 1), with activity A, exhibit 
strong spatial mixing with the same rate <5(-). 
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Under duality, the same holds for the hypergraph matchings. 

We then define the hypergraph self-avoiding walk tree. A walk in a hypergraph 7~L = (V, E ) is a 
sequence (vo, ei, v\,..., eg, vg) of alternating vertices and hyperedges such that every two consecutive 
vertices v t -i, V{ are incident to the hyperedge e* between them. A walk w = (vo, e\, v\,..., eg, vg) is 
called self-avoiding if: 

• w = (vo,ei,vi,eg,vg) forms a simple path in the incidence graph of 'H\ and 

• for every i = 1, 2,... ,£, vertex Vi is incident to none of {ei, e 2 ,..., ej_i}. 

Note that the second requirement is new to the hypergraphs. 

A self-avoiding walk w = {yo,ei,v\,... ,eg,vg) can be extended to a cycle-closing walk w’ = 
(t>o, ei, v \,..., eg, vg, e!, v') so that the suffix (vi, e^+i, Uj+i,..., eg, vg, e', v 1 ), for some 0 < i < £ — 1, 
of the walk forms a simple cycle in the incidence graph of FL. We call v' the cycle-closing vertex. 

Given a hypergraph FL = (V,E), an ordering of incident hyperedges at every vertex can be 
arbitrarily fixed, so that for any two hyperedges ei,e 2 incident to a vertex u we use ei < u e 2 to 
denote that e\ is ranked higher than e 2 according to the ordering of hyperedges incident to u. With 
this local ordering of hyperedges, given any vertex v G V, a rooted hypertree T = 7 saw(^ 5 ^)) 
called the self-avoiding walk (SAW) tree , is constructed as follows: 

1. Every vertex of T corresponds to a distinct self-avoiding walk in FL originating from v, where 
the root corresponds to the trivial walk (v). 

2. For any vertex u in T, which corresponds to a self-avoiding walk w = (v, e\, v\ ..., eg, vg), we 
partition all self-avoiding walks w' = (v, e\, v\ ..., eg, vg, e ', v') in % which extends w, into sets 
according to which hyperedge they use to extend the original walk w, so that self-avoiding 
walks within the same sets extends w with the same hyperedge e!. For every set, we create a 
distinct hyperedge in T incident to u which contains the children of u corresponding to the 
self-avoiding walks within that set. 

3. We further modify the hypertree T obtained from the above two steps according to how cycles 
are closed. For any vertex u in T corresponding to a self-avoiding walk w = {v,e\,v\ ..., eg, vg) 
which can be extended to a cycle-closing walk w' = (v, e\, v\ ..., eg, vg, e', v') such that v' G 
{v, v\,..., vg- 1 }, denoted by e" the hyperedge in w starting that cycle, if it holds that e' <„/ e", 
i.e. the hyperedge ending the cycle is ranked higher than the hyperedge starting the cycle by 
the cycle-closing vertex, then vertex u along with all its descendants in T are deleted from T. 
Any hyperedges whose size becomes 1 because of this step are also deleted from T. 

The construction is illustrated in Figure 3. 

We consider the Gibbs measure of a rooted hypertree T with activity A, and use P^- A to denote 
the marginal probability of the root of T being occupied conditioning on a\. 

Note that each vertex u in 7 sawC^i^) can be naturally identified (many-to-one) to the vertex 
in % = {V, E) at which the self-avoiding walk corresponding to u ends, thus a configuration a a 
partially specified on a subset A C V of vertices in H can be directly translated to a partially 
specified configuration in 7 saw(^)^) through the one-to-many association. We abuse the notation 
and still denote the resulting configuration in T = 7saw(%>^) as <ta, thus is well-defined. 

Theorem 4.2. Let T~i = {V, E) be a hypergraph and A > 0. For any v G V, A C V and G {0,1} A , 
it holds that py( v = P^- A where T = 7sawC kL,v). 
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Figure 3: The construction of 7saw- On the left is a hypergraph T-L and on the right is 7saw(H, i>i ), both drawn as 
incident graphs. The ordering of the hyperedges incident to each vertex in H is given by the subscripts. Each vertex 
or hyperedge in 7saw is labeled by the name of the vertex or hyperedge to which it is identified in T-L. Dashed vertices 
are the ones deleted according to the ordering of incident hyperedges at the cycle-closing vertices. Dashed hyperedge 
is deleted because its size becomes 1. 


Proof. The proof follows the same routine as that of Weitz [34] , with some extra cares to be taken 
to avoid the complications caused by hypergraphs. 

Denote R^f v (X) = Py L v /( 1 — P^f v ) for the ratio between the probability that v in PL is occupied 
and unoccupied conditioning on configuration of A C V. We write Rfy = Rj^ v when v is 
unambiguously the root of T. 

Let d be the degree of the root of T. Suppose that there are ki children contained in i-th 
child-edge, where the order is determined during the construction of 7 saw(%it). %j is the subtree 
rooted at the j-th child in the i-th child-edge. Let A t j = A n %j and cta,-, be the restriction of 
on A ij. Applying the tree recursion (3) for the self-avoiding walk tree T, we have 


p°\A 

Ih'j- 




=1 1 + UU R n 


(4) 


This defines a recursive procedure for calculating Rj>. The base cases are naturally defined when v 
lies in A, in which case Rj^ = 0 if v is fixed unoccupied or RfjX = oo if it is fixed occupied, or when 
v has no child, in which case RfjX = \. 

In the following we describe our procedure for calculating R%£ v at v in the original hypergraph 
PL. The problem comes that the ratio at different neighbors of v may still depend on each other 
when we fix the value at v since there may exist cycles in PL. We resolve this problem by editing the 
original hypergraph around v and imposing appropriate conditions for each neighbor of v. 

Let PL V be the same hypergraph as PL except that vertex v £ V is substituted by d vertices 
V\,V 2 , ...,Vd, where d is the degree of v. Each vertex V{ is contained into a single hyperedge ej, 
where e* is the i-th hyperedge connecting v, and the order here is the same as the one determined 
in the definition of 7 saw (% 5 v). At the same time, we associated each Vi with an activity of A 1//d 
rather than A. It is now clear to see that an independent set in PL with v occupied has the same 
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weight as the corresponding independent set in R v with all the v l occupied, and so is the case when 
v is unoccupied. Therefore, R%£ v equals to the ratio between the probabilities in R v with all Vi 
(1 < i < d) being occupied and unoccupied, conditioning on a\. Let r* be the configuration for 
vertex Vi in which the values of Vj are fixed to occupied if j < i and unoccupied if j > i. We can 
then write this in a form of telescopic product: 

d 

L> a A _ I I p ( 7 A r i 

— LL n H v ,v i ’ 
i—1 

where cr\Ti means the combination of the two configurations cr\ and Tj. 

We can obtain the value of RZfJl, by further fix vertices in e*, the hyperedge containing vy Since 
now Vi is contained only in e,;, we can see that 

\ 1 / d 

TjfTATi _ _”_ 

J rl v ,v i 1 p ff A npij > 

+ A,j=l 

where ki is the number of the vertices other than Vi which is incident to e* and p t j is the configuration 
at vertices of e, in which all the vertices Uij> other than uij are fixed to unoccupied. 

Combining above two equations, we get a recursive procedure for calculating R%£ v in the same 
manner that equation (4) has: 


R H,v ~ X II , , spki jfKTiPa ■ ( 5 ) 

Notice that the recursion does terminate, since the number of unfixed vertices reduces at least by 
one in each step because in calculating all copies vy of v is either fixed (when i' ^ i) or 

ri j Vi j Uij v 7 

erased (when i' = i) from the hypergraph R v /vi. 

We now show that the procedure described above for calculating R^f v results in the same value 
as using the hypertree procedure for Tsasn{R,v) with corresponding condition of cta imposed on 
it. First notice that the calculation carried out by the two procedure is the same, since they share 
the same function (Equation (4) and (5)) when we view them as recursive calls. Furthermore, we 
have the same stopping values for the both recursive procedures. During constructing 7 saw(^j^) 5 
if node u corresponding to walk is not included in the hypertree, which is equivalent to fix u to 
unoccupied in the sense of causing the same effect on the ratio of occupation to its parent node. 
And when node u in the hypertree corresponding to a self-avoiding walk w = (v, e\, v \..., ee, ve), 
with that w can be extended as w' = (w,eg + to a cycle-closing vertex v^ + i = Vi for some 
0 < i < i via a new hyperedge e^+i 0 {eo, ei,..., e^}, and ee + \ < Vi ej, then the node u along with 
all its descendants are deleted. This gives the equivalent effect to parent node of u as if u is fixed to 
unoccupied, or one of the children of u (i.e. the node corresponding to w') to occupied, which is 
what we did to fix the vertices Vj for j < i in r,. Eliminating a hyperedge with no child also does 
not affect the final value of Rj L 

Thus, what is left to complete the proof is to show that the hypertree Tsaw(R v /d> Uij) with 
( PATipij )’s corresponding condition imposed on it is exactly the same as the subtree of Tsaw(Riv) 
rooted at the j-th child vertex of the z-th child-edge of the root with cta’s corresponding condition 
imposed on it. This is enough because then the resulting values are the same for both procedures 
by induction. The observation is that both trees are the hypertree of all self-avoiding walks in R 
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starting at Uij , except that TsAwikL 1 ’/vi, u ij) has some extra vertices which are fixed to be occupied 
or unoccupied depending on whether the corresponding walk reaches v via a higher or lower ranked 
hyperedge, or reaches i- th hyperedge of v, which results in the same probability of occupation at 
the root. □ 

A hypergraph TL is a sub-hypergraph of another hypergraph Q if the incidence graph of TL is 
a subgraph of that of Q, and for hypertrees this is samely defined. Note that for hypergraphs, 
a subgraph is not necessarily formed by a sub-collection of hyperedges, but maybe also by sub¬ 
hyperedges. The 7saw °f a hypergraph TL with maximum edge-size at most k + 1 and maximum 
degree at most d + 1 is sub-hypertree of T*.^. 

Proposition 4.3. Let To = (Vo,Eo) be a rooted hypertree and T = (V, E) its sub-hypertree with the 
same root. For any A C V and any a a £ {0,1} A , there exists a configuration ga 0 £ {0,1} A ° for 
A C Ao C Vo, extending the configuration a a, such that P^- A = P^ 0 . 

The configuration (Ta 0 just extends a a by fixing all the vertices missing in T (actually only those 
who are closest to the root along each path) to be unoccupied. 

Theorem 4.1 follows immediately from Theorem 4.2 and Proposition 4.3. 

Proof of Theorem f.l. Given any hypergraph TL of maximum edge-size at most (fc + 1) and maximum 
degree at most (d + 1), by Theorem 4.2 we have | p^f v — p!fi J = |P^- A — P^| where T = TsAwifiL, v ). 
The distance from the root v to any vertex u in T is no shorter than the distance TL between v 
and the vertex in TL to which u is identified. So the SSM with rate 5{-) on T implies that on the 
hypergraph TL. 

Since TL has maximum edge-size at most k + 1 and maximum degree at most d + 1, its SAW-tree 
T = TsawCH,v) is a sub-hypertree of T^. Thus by Proposition 4.3, we have |P^ — P^| = 
|P T *° d — P T ^°J for some ga 0 ,ta 0 extending ga,ta- The SSM on with rate £(•) implies that on 
T, which implies the same on the original hypergraph TL. □ 


5 Strong spatial mixing 


In this section, we show that for independent sets of the infinite (k + l)-uniform (d + l)-regular 
hypertree T Tk,d> weak spatial mixing implies strong spatial mixing at almost the same rate. 


Theorem 5.1. For every positive integers d,k and any X, if the independent sets of the infinite 
(k + 1 )-uniform (d+ 1 )-regular hypertree T^ with activity A exhibits weak spatial mixing with rate 

S(-) then it also exhibits strong spatial mixing with rate ( 1+A K A+ ( 1+fcA ) - 


By Theorem 4.1, this implies the strong spatial mixing with the same rate on all hypergraphs of 
maximum degree at most d + 1 and maximum size of hyperedges at most k + 1. 

Unlike most known strong spatial mixing results, where the spatial mixing is usually established 
by an analytic approach with help of potential functions, our proof of Theorem 5.1 adopts the 
combinatorial argument used in Weitz’s original proof of SSM for the hardcore model [34]. Weitz’s 
approach gives us a stronger result: It explicitly gives the extremal case for WSM as well as SSM 
among a family of hypergraphs with bounded maximum degree and bounded maximum edge-size. 
It can also easily give us the SSM behavior when at the critical threshold. 
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Assume the hypertree T = T^ is rooted at some vertex v. For £ > 0, let Rif and Rf denote 
the respective maximum and minimum values of Rf achieved by a boundary condition a that fixes 
the states of all vertices at level £. By the monotonicity of the tree recursion, it is easy to see that 
Rif (or Rf) is computed by the tree recursion with initial values at all vertices at level £ to be oo 
(or 0) if £ is even, and 0 (or oo) if £ is odd, with the root v being at level 0. 2 

It is easy to see that fixing a vertex u in T to be occupied has the same effect as fixing ids 
parent to be unoccupied, therefore to prove SSM, it is sufficient to prove the decay of correlation 
conditioning on a subset of vertices in T fixed to be unoccupied. Another key observation from the 
tree recursion is that fixing a vertex u in T to be unoccupied has the same effect as having a local 
activity X u = 0 at vertex u. Now consider a vector A that assigns every vertex u in T = T*. ^ a local 
activity X u . Let Rf( A) and Rf (A) be accordingly defined as the respective extremal values of Rf(X) 
achieved by boundary conditions a fixing all vertices at level £ in the tree T = Tk d equipped with 
the nonuniform activities A. Clearly, by the same monotonicity, Rf( A) can be computed from the 
tree recursion with a nonuniform activities A with the same settings of initial values as the uniform 
case Rf = Rf( A). 

The following theorem shows that basically the decay of correlation is dominated by the uniform 
activity case. 

Theorem 5.2. Fix an arbitrary A > 0. Let X be an assignment of activities to vertices of such 
that 0 < X v < X for every v G For every £ > 1 we have 

RjW Rj 

R ,! X) n, 

Translated to the language of subtrees, the theorem means that the extremal case of WSM 
among a family hypertrees with bounded maximum degree and maximum edge-size, is given by the 
uniform regular tree with the highest degree and edge-size in the family. Technically, Theorem 5.2 
measures the decay of correlation in terms of log R = log . Note that for £ > 2, it always holds 

that pf( A) < and pf(X) > A+ ^ 1+ ^, A ^d + i , where Rf = 1 . Theorem 5.2 implies Theorem 5.1. 

We now consider a slightly different hypertree which is exactly the same as T*. y except that the 
degree of root is d. Denote this hypertree as T k,d- 

Lemma 5.3. For every integer £ > 1 and any assignment of activities X to vertices of such 
that 0 < X v < X for every vertex v, the following two inequalities hold: 

RjW ^ Rj 

Rf(X) ~ Rf' 

1 + kRj{X) ^ 1 + kRj 
l + kR-{X) ~ 1 + kRf ’ 

with the convention 0/0 = 1 and oo = oo. 

2 Note that although the all-oo initial values corresponds to a boundary condition cr that fixes all vertices at level (. 
to be occupied, which may no longer be a valid independent set in the hypertree, the Rf achieved by this choice of 
initial values is actually the same as the Rf with a boundary condition a that fixes exactly one vertex per hyperedge 
to be occupied at level i. 


( 6 ) 

(7) 
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Proof of Lemma 5.3. The proof is by an induction on t. The proof is similar to that of Weitz [34] 
except for the parts dealing with hyperedges. 

First consider the exceptional cases when the denominators in (6) may be zero. Assume 
Rf( A) = Rf (X) = 0, which only happens when the activity of the root is zero. We adopt the 

= 1. Assume Rf = 0, which only occurs when £ = 1. Then Rf( A) = 0 


convention that 


RJW 

RJ{ A) 


R+(A) 


RJ 


also holds, and by convention we have 1 U = —L = oo. Note that these conventions are 

Rg (A) Rg 

consistent with our induction, such that assuming the induction hypothesis f° r an Y 

R e (A) K t (A) 

k assignments of activities 0 < Ai, A 2 ,..., A& < A, there exists a > 0 such that Rf(Xf) = akRf 
and Yli=i Rf (A,) < akRf. 

For the basis, 1=1. We have Rf (X) > Rf = 0, Rf = A, and Rf( A) = A r where A r is the 
activity of the root. The hypotheses (6) and (7) are true since A r < A. 

Assume (6) and (7) are true for an £ > 1. We will show that they are true for £ + 1. The 
following recursion holds 


R*AX) 


01 


1 + Ej=i Re (A 


•y; 


rr Epltl + ^fe)) 

^+i(A) fJi 1 + E J=1 Re (Ay) i=i Ej-=i(l + kRi (A ij)) 


where A^ stands for the restriction of the assignment A to the subtree of T^ rooted at the j-th child 

l-\-kR+(\-■) l-\-kR+ 

in the z-th edge incident to the root. By induction hypothesis (7), we have — ' A < -——A, so 

(A ij ) 1+AC/t^ 

immediately, 


Rt+ id) A El = i(i + Mtfe)) ^ n + kRf \ d r } +1 

Ri +1 3) + “ \n-kRj) RJ + 1 ’ 

a 

where the inequality is due to the simple fact that if cii > bi > 0 and ff < t for all i, then fiir 1 f < t. 
This proves (6) for £ + 1. Next we will prove (7). Recall the tree recursions: 


R ± 


■e+i 


(A) = A r n 


d 


and 


1 


' 1 1 + E Rf (Xij) 


Rf, 1 = A TT — ^ , 

,+1 M 1 + kRj 


j =1 


where A r is the local activity assigned by A to the root r. Observe that if Ej=i Re (A ij) > 
for all i E [d], then i?^_ 1 (A) < R^ ) _ 1 , which combined with (6) for £ + 1 that we just proved above, 

would give us that + , < , + , pl +1 . In this good case, the hypothesis (7) easily holds for £ + 1. 

We then show that the opposite case where Ej=i Rf (Xij) < for all i represents the worst 
possible case, and it is enough to prove the hypothesis (7) under this condition. To see this, assume 
to the contrary that for some *o, Ej=i^7(A*oi) > kRf. We then construct a X' that satisfies 
Y?j =l Rf(X'i oj ) < kRf and has an even worse ratio between 1 + kRf +1 and 1 + kRf +v Let X be 
the same as A except that for every j E [k], X lQj is uniform and is equal to A everywhere. Clearly, it 
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holds that Y^j= 1 R e (K, 
have 1+kR t_^Joj) < 

1+fcR^ (A.^q j) l+/c_R^> 


1 +kR+ 


= fci? £ . On the other hand, by the induction hypothesis, for every j we 
, and hence 


i + Ej-i RfiKi) _ gjU(i + fci?j~(A i0J -)) ^ 1 + kRj 
1 + E?=i^-( 4,0 Ej=i(l + kRj(Xi 0j )) - 1 + kRJ* 


E" 


< t. 


where again the inequality uses the fact that if a* > b{ > 0 and < t for all i, then ™ 

4 Z_/i=l * 

Note that X' only changes the activities of all the subtrees rooted by the the children in the io-th 
edge of the root. So we have 


R t i(A) yr 1 + EjLl Rt&i) ^ Rj+lW 

R e+ifi) i =11 + Ej=i R t &ij) R e+ifi) 

<L i ^ 

and Rj +l (A) = A r JJ- - -— < Rf +1 (\'). 

* =1 1 + E R i (E?) 

3 =1 


Combine the two inequalities, we have 1+kR ^+iW < 1+Aj ! + J ^^ an even worse case. So for the rest 
1 ’ l+kRJ +1 (X) ~ l+kRJ +1 (\')’ 

we only need to consider the case in which for every i, Ej=i R J Xij) — kRj. 

For every 1 < i < d, we can choose 0 < < 1 so that Ej=i = otikRj . Fix i and by 

the induction hypothesis, for every 1 < j < k we have If all i?7(A„j equal zero, then 

Rg (Xij) tig 

Ej=i R t (Ej) — a i.kR.f trivially holds as we argued in the beginning. Otherwise, note that since 
not all RJ(Xij) are zero, we must have £ > 1, so if Rj{Xij) = 0 then R+ (Xij) = 0. Thus we also 

have S =1 < ^=- In conclusion, in both cases we have EEi R t (Ej) < otikR^. 

z^j =i Rg (*ij) Rg j ■ ' 

Observe that A r < A and flEi + 5- =1 — 1- it holds that 


1 + kR+ (A) 1 + kX r nil 


!+Ej=i R e (Ej) 


1 + kX n*=i 


i 


1 + kRg i 1 (A) 1 + k\ r Ui=i 


< 


1+a.ikR, 


1+E •=! «/(E) 


i + an != i 


1 


1+otikRf 


Now it is enough to show that for every a such that 0 < < 1 for all 1 < i < d, it holds that 

i+kx nt, 1 


l+CKi/ci?, 


1 + kX Yl i=1 


1 

1+ctikRj 


< 


1 + kRt 


t+i 


1 + kR : 


t +1 


which is equivalent to the following: 


+ kx n i 

i= 1 


1 


1 + kRt 


t +i 


+ (XikR f 1 + kR 


-kX 


1 + kRt 


t +1 


t +1 


n 


1 + ^1 + 


< 0. 


( 8 ) 
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If CKi = 1 for every i then the inequality (8) trivially holds. By symmetry, we only need to show the 
LHS of (8) is increasing in a\. In fact, the partial derivative with respect to a.\ of LHS in (8) is: 

k 2 XRJ -4 1 k 2 X(l + kR+ +1 )R+ -jl 1 

1 + a\kRj ~~ 1 + aikRj (l + kRj +1 ) (l + a\kR ^) 1 + a.ikR+ 

To prove it is nonnegative, it is equivalent to show that 

(l + kR^ + ^j R > 1 + ot\kRj -i—r 1 + a t kR,^ 

(l + kRJ + 1) RJ ~ 1 + a±kRJ ^ 1 + atikRJ 

To prove (9), we first observe that RJ is increasing in i and R^ is decreasing in £, which is exactly 
the same to prove as the same property of the hardcore model. This gives us the so-called sandwich 
condition: 

Re — R-e+i — R~e+ i — Re ■ 

R+ R+ 

therefore —4- > 1 +1 . We are now ready to prove (9): 

Rz ^+1 

(1 + kR+ +1 )H+ 

(l + kR i+1 ) R e 


The last inequality uses the fact that Rkjk > 
proof of Lemma 5.3. 

Observe that the subtree rooted at the child of the root of T ^ is isomorphic to Tk,d- While at 
the root of T*, d, we have 

B}( A) = ft 1 + E^-Stlfe) < A + M+V + 1 = fl+(T M ) 

Re (^) i= 1 1 + £*=, Rf,-i(Mj) \1 + kR t _i J Rf (1I'm) 

Together with Lemma 5.3, this completes our proof of Theorem 5.2. 

Calculation of the decay rate. The WSM rate of our model on the infinite {k + l)-uniform 
(d + l)-regular hypertree is the same as the hardcore model on the infinite (d + l)-regular tree 
with activity kX. The WSM rate on regular tree has been addressed implicitly in the literature [18,32]. 
Here we provide an analysis for the decay rate for the completeness of the paper. 

Let fd,k(%) — denote the symmetric version of the tree recursion on and substituting 

x = kR. Since fd,k(%) is decreasing in x, it follows that there is a unique positive fixed point x such 
that x = fd,k(x). Let /^ fc (x) = —pyy be the derivative of fd,k{x) evaluated at the fixed point x = x. 
The following proposition is well known for hardcore model (see e.g. [18,32]). 


> 


(1 + kR+)R+ 


t- t-i 


(1 + kR ( ) R 


e+i 


_ 1 + kRj -A- 1 + kRj 
1 + kRj 1 + kRJ 

1 + kRg -j~r 1 + aikR^ 

~ 1 + a±kRJ “ 1 + aikRJ 


R e and 0 < ctj < 1. So (9) is proved, which finishes our 
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Proposition 5.4. \f' dk (£)| = < 1 if and only if A < A c . And \f d>k (x)\ <1 if A < A c . 

We write /(a;) = fd,k(x ) if k and d are clear in the context. The main result of this part is the 
following theorem. 

Theorem 5.5. For any positive integers d,k, assuming A < X c , the model on T k)dj exhibits weak 
spatial mixing with rate 6(1) such that for all sufficiently large t: 

• if A < X c , then 6(£) < C'i|/ , (x)|^~ 4 ; 

•if A = A C; f/ien <5(f) < ^=2^; 

where C±,C 2 ,(o > 0 are finite constants depending only on k,d and A. 

Theorem 4.1, Theorem 5.1 and 5.5 together prove the SSM part of Theorem 1.2. 

Denote g(x) = / ( f(x )) = A:A ^1 + (i+tp ) ■ B eas y to see that ® = s(£)- 

Lemma 5.6. If A < A c t/ien /or any x > x we have g(x) — g(x) < f'(x) 2 (x — x). 

Proof. By the mean value theorem, for any x > x, there exists a z £ [x. x] such that 


g(x) - g(x) = a(z)(x - x), 


( 10 ) 


where a(z) = g'(z ) = We will bound the maximum value of a(z) when A < A c . 

Consider the derivative of a(z), 


a'(z ) = A(z) (d — l)k\ — (1 + z)° 


where A(z) = — d ( d +i)fcAg(z) q Let z* = ((d — l)fcA) 1 / <i — 1 be the solution of (d — l)k\ = 

(1 + z) d . Note that [(d — l)kX — (1 + z) d ] is decreasing in z, therefore a(z) < a(z*) for all 
z > 0. Due to proposition 5.4, if A < A c then \f'(x)\ = < 1 and hence x < thus 

a'(x) = A(x)[(d — 1)&A — Q] < 0, which means x > z* and a(z) is decreasing in z for any z > x. 
On the other hand, we have a(x) = f(x) 2 . Thus for any z > x, we have a(z) < a(x) = f'(x) 2 . 
Plug it into (10). The lemma is proved. □ 

Proof of Theorem 5.5. It holds that R 2 = Rf = X > x/k. Note that kR^ = g(kRff_f). Due to the 
nronotonicity of g(x), we have x < kRf < kX for every i > 1. 

Consider the case A < A c . First consider the (k + l)-uniform d-ary hypertree T k ,d- By the mean 
value theorem and Lemma 5.6 we have 


kRff — x =g(kRg _ 2 ) — g(x) < f'(x) 2 (kRff_ 2 — x) . 

We apply this inequality recursively. Since kR^ — x < kX, for any f>2we have 

kRf -x < kX\f\x)\ £ - 2 . (11) 

To bound Rf we apply the mean value theorem again. There exists a z E [x, kR.f ] such that 

x - kRj = f(x) - f(kRj_f) = \ f(z)\(kRj_ 1 - x). 


21 









Since \f'(z)\ < kdX for all z > 0, combined with (11) we have 

x — kRj < kd\(kRj'_ 1 — x) < k 2 d\ 2 \f , (x)f~ 3 . 


At last, R.f — R £ = \(kR~£ — x + x — kR £ ) < C(|/'(x)| £ 3 for some C[ > 0 depending only on d, k 
and A. This only gives us the desired decay rate at the (k + l)-uniform d-ary hypertree Move 
to the (k + l)-uniform (d + l)-regular hypertree T k,d- The only difference is that the root has d + 1 
children instead of d. By the mean value theorem, this will multiply at most a finite constant factor 
C'( to the gap — RJ at the root of T^d, where C" > 0 depends only on d, k and A. Overall, this 
gives us that 

Pt - pI < Ri - RJ < Ci\f(x)t 4 

for some C\ > 0 depending only on d, k and A. This finishes the case that A < A c . 

Now we consider the critical case that A = A c = We start by considering the 

(A: +1)-uniform d-ary hypertree T^- It is easy to verify that in this case x = jTj-, a(x) = f'(x ) 2 = 1, 
z* = x and a'(x) = 0, where at(z) and z* are defined in the proof of Lemma 5.6. And we have 
a"(x) = — gy Taylor’s expansion for g(x) at the hxed point x = x, we have that for any 

constant c > 0 there exists a constant xq > x such that for any x < x < xq, it holds that 


g(x) = g{x) + a(x)(x — x) + Q ^ (x — x) 2 + Q ^ (x — x) 3 + o ((x — x) 3 ) 

2 6 

1 ,, (d + l)(d — l) 3 

< —-- + X — X --(x — x) + c(x — x) . 


d- 1 


6 d 2 


We define a sequence x\ = kRf,x 3 = kR% = g(x 1),... and generally X24+1 = g(x 2 t-i)- The 
sequence is strictly decreasing because R,f is decreasing in i. Furthermore, lim^oo X24+1 = x. This 
is due to a(x) < 1 for any x > x. 

Denote 624+1 — X24+1 — x. Let c be some positive constant such that ^ +1 g)p ^ -c > 0 . Denote 

j3 = ^ - c and 7 = There must be some sufficiently large to such that 624 0 +i < ^ 

and for any t > to, it holds that 


£ 24+3 - g{x2t+ 1 ) - ^ 1 < £ 24+1 - P<4t+ 1 - 


We apply an induction to complete the proof. For the basis, when t = to we have £24 0 +i < ^=- 
Assume the hypothesis 


£ 24+1 < 


7 

y/t — to T 2 


( 12 ) 


for some t > to and we will prove it holds for t + 1. First, notice that h(x) = x — fix 3 is strictly 
increasing when 0 < x < Thus, we have 


£ 24+3 < £ 24+1 - P^2t+1 < 


7 


-p 


7 


yjt — t 0 + 2 (£ — to + 2) 2 
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We only need to prove that u [ ,. — B --- 3 - < u [ ,, . Let t = t — to + 2. It is equivalently 

y/t-t 0 +2 (£_£ 0 +2)2 yi-io+3 

to show that 


Note that 


.3(1 

t! 2 ' 


\/p v'*' +1 


<^7 2 - 


(13) 


/ 3 

t's 


--— 

\\[t' v'i' + l 


= / Vt' + i-VF 

\ vWTI) , 

< Vt'(y/t' +1 - V^) 

1 


< x/i 7 


1 

< 

“ 2 


y/t' + 1 + Vtf 


y 2 = i 

conclusion, for any t > to, it holds that 


Since /Ly 2 = we just prove the inequality (13), and finishes the induction (12) for all t > to. In 


7 


kRtfj-i — x < - , _ 

27+1 - y/t - to + 2 


The rest of the proof is exactly the same as our proof of the case A < A c . 


ffl 


6 Approximation algorithms and inapproximability 

For 0 < e < 1, a value Z is an e-approximation of Z if (1 — e)Z < Z < (1 + e)Z. Recall that x is 
the unique fixed point solution to x = fd,k(x ) = LA(1 + x)~ d . 

Theorem 6.1. If A < A c = , then there exists an algorithm such that given any e > 0, 

and any hypergraph H of n vertices, of maximum edge-size at most (k + 1) and maximum degree at 
most {d + 1), the algorithm returns an e-approximation of the partition function for the independent 

sets of R with activity \, within running time , where k = In (qgr)- 

For the critical case where A = A r , there exists an algorithm that for the above R returns an 

e-a Ppm at,on of the Ignition function untk.n ru„nen 9 t.me n(kdf(^F). 

By duality, the same algorithm with the same approximation ratio and running time works for 
the matchings of hypergraphs of maximum edge size at most (d + 1) and maximum degree at most 
(k + 1). By Proposition 5.4, \f' dk {x)\ = yyA < 1 if A < A c , therefore, when A < A c , the running 
time of the algorithm is Poly(n, 1 ) for any bounded k and d, so the algorithm is an FPTAS for the 
partition function. And when A = A c , the algorithm is a PTAS for the log-partition function. The 
algorithmic part of the main theorem Theorem 1.2 is proved. 

In particular, when d = 1, the model becomes matchings of graphs of maximum degree (k+ 1), and 
the uniqueness condition A < A c (Tfc^) is always satisfied even for unbounded k since A c (Tfcj) = 00 . 
In this case, the fixed point x for /i,fc(x) = can be explicitly solved as x = ~ 1+v/ 2 1+4 —. We have 
the following corollary for matchings of graphs with unbounded maximum degree, which achieves 
the same bound as the algorithm in [1], 
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Corollary 6.2. There exists an algorithm which given any graph G of maximum degree at most A, 
and any e > 0, returns an e-approximation of the partition function for the matchings of G with 
activity A, within running time ^r^°( v/ '^ log 

With the construction of hypergraph self-avoiding walk tree and the SSM, the algorithm follows 
the framework by Weitz [34], We will describe an algorithm of approximating the partition function 
for independent sets in hypergraphs with activity A. Under duality this is the same as approximately 
counting matchings with activity A. 

By the standard self-reduction, approximately computing the partition function is reduced 
to approximately computing the marginal probabilities. Let TL = (V. E ) be a hypergraph and 
V = {ni,... ,v n }. To calculate Z = Zy{ A), it suffices to calculate the probability of the emptyset 
/i(0) as it is exactly 1 /Z. Let 0* be the configuration on vertices v\ up to Vi where all of them are 
unoccupied, and 1 the probability of Vi being occupied conditioning on all vertices v\ up to Uj_i 
being unoccupied. Then we have 1/Z = ]”[iLi(l — Pvf ') and logZ = — — Pvf -1 )- Note 

that (1 — Pu/ _1 ) > yyy for the probability of vertex unoccupied by an independent set and A c < 4 
for any d > 2 and k > 1. To get an e-approximation of Z, it suffices to approximate each of p?/ -1 
within an additive error ^(ITXyn • And to get an e-approximation of logZ, which can be obtained by 

getting an e-approximation of every — log(l — pf/ -1 ), it is sufficient to approximate each of p?/ -1 
within an additive error 0 

By Theorem 4.1, we have p a v = Pf- where T = 7 saw(%> v), i.e. the marginal probability of v 
being occupied is preserved in the SAW tree of TL expanded at v. And the value of Pf- can be 
computed by the tree recursion (3). To make the algorithm efficient we can run this recursion up to 
depth t and assume initial value 0 for the variables at depth t as the vertices they represent being 
unoccupied. The overall running time of the algorithm is clearly 0(n(kdY) where t is the depth of 
the recursion. By the strong spatial mixing guaranteed by Theorem 5.1 and Theorem 5.5, if A < A c , 

then the additive error of such estimation of pf, is bounded by C\ ■ ( AL I for some constant 

C\ > 0 depending only on k, d and A. We shall choose an integer t so that C\ ■ ( jfG 1 < 2 ( 1 +A)n > 

which gives us the suitable time complexity required by the FPTAS for the partition function. And 
when A = A c , the additive error of pf is bounded by Cijs/t — to for some constants Ci,t o > 0 
depending only on k,d. We shall choose an integer t = OfX^ln^) 2 ) to get the desirable additive 
error for every marginal probability, which gives us the PTAS for the log-partition function. This 
completes the proof of Theorem 6.1. 


Inapproximability. For the inapproximability, by applying an AP-reduction [3] from the inap- 
proximability of the hardcore model [9,31], we have the following theorem. 

Theorem 6.3. If A > --A c , there is no PRAS for the partition function or log-partition 

function of independent sets of hypergraphs with maximum degree at most d + 1, maximum edge-size 
at most k + 1 and activity X, unless NP=RP. 


Proof. The reduction is as described in [3], which is reduced from the hardcore model. Given 
a graph G(V,E) with maximum degree at most (d + 1), we construct a hypergraph TL(Vn, Ey) 
as follows. For each v € V, we create t = distinct vertices w Vj i, w Vi 2 , ■ • •, w Vj t and let 

Vy = {w Vy i \ v e V,1 < i < t}. And for every edge e = (u,v) G E, we create a hyperedge 
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Figure 4: The infinite hypergraph that achieves the uniqueness threshold 2fc+ ^ 1 1 - > A, 


S e = {vj Ut ],..., w U) t, w v> i ,..., w v j} and let E % = {S e | e £ E}. Clearly, the maximum degree of E 
is at most d + 1 and the maximum edge-size of E is at most 2t < k + 1. We define 

Z H ( A) = E A^ and Zg(X) = ^ A^L 

/: IS of n I: IS of G 

Note that by the above reduction every independent set / of G is naturally identified to distinct 
independent sets of hypergraph E such that a v £ V is occupied by / if and only if one of w v ^ is 
occupied by the corresponding independent set of E. Thus Zy^(X) = Zg(X') where X' = tX. 

Recall that G is an arbitrary graph of maximum degree at most d + 1 . According to Sly and 
Sun [31], when X' > ^yrfp+T) there exists a constant c such that unless NP=RP, the partition 
function Zg(X') can not be approximated within a factor of c n in polynomial time, which means 
there is no PRAS for the log-partition function logZ^A 7 ) when X' > , be. when A > 

d d _ 2fc+l+(-l) fe , 

L(fc+l)/2j(d-l) d + 1 “ k +1 Ac ’ 

□ 

The reduction in Theorem 6.3 transforms a hardcore model on a graph with maximum degree 
d+ 1 and activity 2fc+ ^ 1 — A to an instance of hypergraph independent sets with maximum degree 
at most d + 1, maximum edge-size at most k + 1, and activity A. In particular, it transforms the 
infinite ( d+ l)-regular tree i to the infinite 2|_(fc+ l)/2j-uniform hypergraph as shown in Figure 4. 

This infinite hypergraph has the uniqueness threshold ^^. +1 ) / / 2 f J(d-i) d + 1 = A -+1 ^ ^ c ' 

7 Local convergence of hypergraphs 

For the infinite ( k + l)-uniform ( d + l)-regular hypertree T^, a group G of automorphisms on T*. ^ 
classifies the vertices and hyperedges in into orbits (equivalent classes). We consider only G 
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with finitely many orbits. By Proposition 3.2, such group G can be uniquely identified by a pair 
of branching matrices ( D,K ) defined in Section 3 that classifies vertices and hyperedges in T k,d 
into finitely many types (labels), where the incidence relation between vertices and hyperedges with 
each type is specified by ( D , K ). We use T^ d to denote this resulting labeled hypertree. 

For a finite hypergraph T~i = ( V, E), we also consider the classification of vertices V = l±lie[r^] ^ 
and hyperedges E = l+J^r , Ej into disjoint types. 

Given a hypergraph T~L and a vertex v in T~L, write Bfiv) = Bu d {v) for the t-neighborhood around 
v in 77, that is, the sub-hypergraph induced by the vertices in T~L at distance at most t from v. For 
the labeled hypertree T® d , since once the type of the root is fixed the neighborhoods are identical 
(in terms of types), for each i £ [r„], we can denote T^ d (t, i) = Bx y t{v) where T = and v is any 
vertex in T of type-i. 

The following definition is inspired by those of [31] and [4] for spin systems. Intuitively, a 
sequence of finite structures locally resemble the infinite tree structure along with the suitable 
symmetry which exhibits the uniqueness/nonuniqueness phase transition at the critical threshold, 
so the measures on the sequence of finite structures may have local weak convergence to that on the 
infinite tree. The existence of such local convergence profoundly leads to several most important 
phase-transition-based inapproximability results [6,7,9,10,26,30,31] and is a key to the success of 
random regular bipartite graph as a gadget for anti-ferromagnetic spin systems. 

Definition 7.1 (local convergence). Let Tin = (V n ,E n ) be a sequence of random finite hypergraphs, 
whose vertices V n = V nd and hyperedges E n = l±J ?e [ Te ] E n j are classified into disjoint types, 

and for each i £ [ t v \, let I n . L £ V UtS denote a uniformly random vertex in V n of type-i. 

We say the LL n converge locally to T® d , and write Li n —h oc if for all t > 0 and i £ [t v \, 
Bt(I n ,i ) converges to T f d {t,i) in distribution with respect to the joint law P n of {Hu, In,i): that is, 

linr P n (Bt{I n ,i) = TTfcd(^)') = 

n —>oo \ ’ / 


where = denotes isomorphism which preserves vertex- and hyperedge-types and the incidence relation. 


Consider the natural uniform random walk on the incidence graph of and its projection onto 
the finitely many disjoint orbits (types) for vertices and hyperedges, which gives a (bipartite) finite 
Markov chain. It is quite amazing to see that the reversibility of this projected chain determines 
whether there exists a sequence of finite hypergraphs that converge locally to Tf d . 


Theorem 7.1. Let G be an automorphism group of with finitely many orbits for vertices and 
hyperedges. Let D and K be the branching matrices that corresponds to G as defined in Section 3. 
There is a sequence of random finite hypergraphs LL n —hoc d if and only if the Markov chain 


P = 


0 


l 

Lfc+l 


K 


o 


is time-reversible. 


We say a uniform random walk over a hypergraph TL is a uniform random walk on the incidence 
graph of Ti: that is, a random walk moves between vertices and hyperedges. Then the Markov chain 
P is the projection of the uniform random walk over T k,d onto the equivalent classes of vertices 
and hyperedges (i.e. the orbits of the automorphism group G that corresponds to the D and K). 


0 D 
K 0 

the (weighted) incidence relation between vertex- and hyperedge-types in the following way: each 


Meanwhile, matrix 


is the adjacent matrix for a directed bipartite graph that describes 
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directed bipartite edge from vertex-type-* to hyperedge-type-j (or vice versa) is assigned with weight 
dij (or kji). So the Markov chain P is also the random walk on this directed bipartite graph where 
the transition probability of each directed edge is proportional to its weight. 

For the bipartite Markov chain P , recall that due to Proposition 3.2, P must be irreducible. Then 
the time-reversibility of P is equivalent to the following: There exist positive vectors p = (pi)*e[r„l 
and q = ( ( Ij)je[ Te ] that satisfy the bipartite detailed balanced equation: 

Pidij = qj kji 


for every (i,j) G [t v ] x [r e ]. Without loss of generality, we assume YhiPi + Ylj Qj = -*-■ 

In fact, it is easy to check that pD = (k + T)q and qK = (d + 1 )p, therefore the p and q are 
respectively the left eigenvector of DK and KD both with eigenvalue (d+l)(/c-|-l). Since both DK 
and KD are irreducible, due to the Perron-Frobenius theorem, the only positive left eigenvectors p 
and q are the ones that are associated with the Perron-Frobenius eigenvalue (d + l)(/c + 1) and are 
one-dimensional. 

Furthermore, it must holds that jjljjj = . Denote p' = ^ | and q' = We have \\p’\\ = 

Ilg'H = 1 and Pijfii = for every (i,j) G [r„] x [r e ], i.e. p' is the vertex-stationary distribution 

and q' is the edge-stationary distribution. We will mostly use p and q in our proof of Theorem 7.1. 
Recall for the automorphism group G defined in Section 3 such that X c (T^ d ) = A C (T k t d) = k(d-i) d + 1 » 
i.e. the uniqueness of G-translation-invariant Gibbs measures on represents the uniqueness of all 


Gibbs measures on Tthe branching matrices are given as D 


1 

d 


d 

1 


and K = 


k 1 
1 k 


It is easy 


to verify that the resulting Markov chain P is not time-reversible. It then follows from Theorem 7.1 
that there does not exist any sequence of random finite hypergraphs that converge locally to T*. ^ 
with the symmetry G assumed by the extremal Gibbs measures p + , whose uniqueness represents 
the uniqueness of all Gibbs measures. 


Remark. Given branching matrices D and K , instead of considering T-L n that converges locally for 
every type to the T® d as in Definition 7.1, we can alternatively define a sequence 7~L n that converges 
locally in average to the d : that is, for all t> 0, the B t (I n ) converges to T^ d (t,I) in distribution, 
where I n denotes a uniformly random vertex in the finite hypergraph 7~L n , and I denotes a random 
vertex-type chosen according to the vertex-stationary distribution ff. This definition looks more 
analogous to the local convergence defined in [31] for the anti-ferromagnetic 2-spin system. But we 
will see the two definitions are equivalent: A sequence TL n —>i oc also converges locally to in 

average, since by double counting the portion of vertices of type-i must converge to p[ as n —> oo; 
and conversely, a sequence converges locally to in average must also haveH n —hoc simply 
because neighborhoods of vertices of different types cannot be isomorphic to each other. 


Proof of Theorem 7.1. We will prove the necessity of the reversibility of the chain by a double 
counting argument and the sufficiency is proved by explicitly constructing the sequence of the finite 
hypergraphs. 


Double counting. Let 7i n = ( V n ,E n ) where V n = 1+J seri V n , s and E n = [+J fer> E n ,t- Assume that 
B n —h oc 
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For T M sucF tFat tliere is a hypergraph sequence % n = (V n = l+J sgTV V njS , E n = l+| tgTe E n>t ) 
converging locally to T? we show that the Markov chain P is time reversible. The proof is by a 
double counting of the number of vertex-hyperedge pairs with specific type combination. 

Since the 1-neighborhood of the vertex with each type in T~L n converges in distribution to the 
1-neighborhood of the vertex with the same type in for sufficiently large n, we have all but a 
o(Infraction of vertices in 1-L n whose local transitions between vertex-types and hyperedge-types 
within 1-step are given precisely by D and K. Thus, for every (i, j) £ \t v ] x [r e ], the total number 
of incident vertex-hyperedge pair (v, e ) with v £ V n , t and e £ E n j (counted from the vertex-side 
and from the hyperedge-side) is given by 

dij(\V n} i\ + o(l)) = kji(\E n j\ + o(l)). 

As n —> oo, we will have (dij\V n ^\)/(kji\E n j\) —> 1, or equivalently 

| E n . j | dij 

I Vn,i | kji 

for all (i. j) £ [r„]x[r e ] such that dij, kji ^ 0. Thus there exists positive^, qj such that qj/pi = dij/kji 
for all such Since DK and KD are irreducible, we have unique corresponding positive left 

eigenvectors, which is (pi)*e[r„]> (Qj)je[T e ] here, such that pidij = qjkji for all (i,j). 


Construction of 'H n . Assume the Markov chain P in Theorem 7.1 to be time-reversible, and 
let p = (pi)ie[r„] and <f = (qj)je[T e ] he the unique positive vectors satisfying pAj = qjkji for every 
(i,j) £ [t v \ x [r e ] and YliPi + Qj = 1- The sequence of finite hypergraph sequence Jin that 
converges locally to T k.d is constructed as follows. The number n is approximately the total 
number of vertices and hyperedges in T-L n (where the approximation is due to rounding). 

• For each s £ [t„] and t £ [r e ], let V ntS be the set of | ~p s n\ vertices of type s, and E n j be 
the set of \qtn\ hyperedges of type t. We then describe hypergraphs 77 n = (V n ,E n ) where 
V n = l+J sgTij Vn )S and E n = [+l igTe E n j. 

• For each s £ [t v \ and t £ [r e ], let N s> t = \d s tp s n\ = \kt s qt , n\. Sample a uniformly random 
permutation / : [N s> t\ — > [lV S)t ], and create an incidence between the i-th vertex in V n ^ s and 
the j-th hyperedge in E n j for every (a, b = /(a)) with a £ i + \p s n\ Z and b £ j + [ ~qtn\ Z. 

Note that as normalized Perron eigenvectors for irreducible integer matrices, the p and q must 
be rational. Then there are infinitely many n such that N s t/\V ns \ = d s t and N Si t/\E n j\ = kf S . 
Without loss of generality, we can consider only these n, since otherwise it will contribute at most 
o(l)-fractions of bad neighborhoods. 

Viewing multi-edges in the incidence graph of 1-L n as different edges, it holds that each vertex 
of type-s is incident to exactly d s t hyperedges of type-f and each hyperedge of type-t is incident 
by exactly kt s vertices of type-s. Therefore it is sufficient to show that for any finite r > 0 the 
probability that the r-neighborhood of a vertex in 77 ri has no circle is 1 as n —> oo, i.e. almost 
surely the r-neighborhood of a vertex in T~L n is a hypertree. This can be proved easily by a standard 
routine of Galton-Watson branching process (see e.g. Ch. 9 in [15]) since the neighborhood is of 
constant size and the probability of reencountering a vertex or an edge from a population whose 
size goes to oo goes to 0. 

Tltf 
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